Unsupervised learning human's activities by overexpressed recognized non-speech sounds

نویسندگان

Serge Smidtas

Magalie Peyrot

چکیده

Human activity and environment produces sounds such as, at home, the noise produced by water, cough, or television. These sounds can be used to determine the activity in the environment. The objective is to monitor a person’s activity or determine his environment using a single low cost microphone by sound analysis. The purpose is to adapt programs to the activity or environment or detect abnormal situations. Some patterns of over expressed repeatedly in the sequences of recognized sounds inter and intra environment allow to characterize activities such as the entrance of a person in the house, or a tv program watched. We first manually annotated 1500 sounds of daily life activity of old persons living at home recognized sounds. Then we inferred an ontology and enriched the database of annotation with a crowed sourced manual annotation of 7500 sounds to help with the annotation of the most frequent sounds. Using learning sound algorithms, we defined 50 types of the most frequent sounds. We used this set of recognizable sounds as a base to tag sounds and put tags on them. By using over expressed number of motifs of sequences of the tags, we were able to categorize using only a single low-cost microphone, complex activities of daily life of a persona at home as watching TV, entrance in the apartment of a person, or phone conversation including detecting unknown activities as repeated tasks performed by users.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints

In this paper, unsupervised learning is used to separate percussive and harmonic sounds frommonaural non-vocal polyphonic signals. Our algorithm is based on a modified non-negative matrix factorization (NMF) procedure that no labeled data is required to distinguish between percussive and harmonic bases because information from percussive and harmonic sounds is integrated into the decomposition ...

متن کامل

Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation

This paper proposes an algorithm for separating monaural audio signals by non-negative tensor factorisation of modulation spectrograms. The modulation spectrogram is able to represent redundant patterns across frequency with similar features, and the tensor factorisation is able to isolate these patterns in an unsupervised way. The method overcomes the limitation of conventional non-negative ma...

متن کامل

Parametrisation of the speech space using the self-organising neural network

Speech recognition is a diicult problem due to the inability of current systems to cope with connected speech. Neural networks are able to learn some aspects of this task. An unsupervised learning scheme like the self-organising map can be used to both classify and order the speech sounds and provide a front end to higher level processing. A map of phonemes (phonotopic map) is used to trace tra...

متن کامل

Joint Word Segmentation and Phonetic Category Induction

We describe a model which jointly performs word segmentation and induces vowel categories from formant values. Vowel induction performance improves slightly over a baseline model which does not segment; segmentation performance decreases slightly from a baseline using entirely symbolic input. Our high joint performance in this idealized setting implies that problems in unsupervised speech recog...

متن کامل

Cue Integration With Categories: Weighting Acoustic Cues in Speech Using Unsupervised Learning and Distributional Statistics

During speech perception, listeners make judgments about the phonological category of sounds by taking advantage of multiple acoustic cues for each phonological contrast. Perceptual experiments have shown that listeners weight these cues differently. How do listeners weight and combine acoustic cues to arrive at an overall estimate of the category for a speech sound? Here, we present several si...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1311.1935 شماره

صفحات -

تاریخ انتشار 2013

Unsupervised learning human's activities by overexpressed recognized non-speech sounds

نویسندگان

چکیده

منابع مشابه

Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints

Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation

Parametrisation of the speech space using the self-organising neural network

Joint Word Segmentation and Phonetic Category Induction

Cue Integration With Categories: Weighting Acoustic Cues in Speech Using Unsupervised Learning and Distributional Statistics

عنوان ژورنال:

اشتراک گذاری